SEQUENCE LISTING 

SEQ ID NO:l Synthetic Sso7d gene 

GCAACCGTAAAGTTCAAGTACAAAGGCGAAGAAAAAGAGGTAGACATCTCCAA 
5 GATCAAGAAAGTATGGCGTGTGGGCAAGATGATCTCCTTCACCTACGACGAGGG 
CGGTGGCAAGACCGGCCGTGGTGCGGTAAGCGAAAAGGACGCGCCGAAGGAGC 
TGCTGCAGATGCTGGAGAAG 
CAGAAAAAG 

10 SEQIDNO:2 The amino acid sequence of Sso7d. 

ATVKFKYKGEEKEVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDAPKELLQ 
MLEKQKK 

SEQ ID NO:3 The DNA sequence encoding the Sso7d-ATaq fusion protein 

^ 15 ATGATTACGAATTCGAGCGCAACCGTAAAGTTCAAGTACAAAGGCGAAGAAAAA 
p: GAGGTAGACATCTCCAAGATCAAGAAAGTATGGCGTGTGGGCAAGATGATCTCC 
m TTCACCTACGACGAGGGCGGTGGCAAGACCGGCCGTGGTGCGGTAAGCGAAAAG 
GACGCGCCGAAGGAGCTGCTGCAGATGCTGGAGAAGCAGAAAAAGGGCGGCGG 
g TGTCACTAGTCCCAAGGCcCTGGAGGAGGCCCCCTGGCCCCCGCCGGAAGGGGCC 
y 20 TTCGTGGGCTTTGTGCTTTCCCGCAAGGAGCCCATGTGGGCCGATCTTCTGGCCCT 
y GGCCGCCGCCAGGGGGGGCCGGGTCCACCGGGCCCCCGAGCCTTATAAAGCCCT 
H CAGGGACCTGAAGGAGGCGCGGGGGCTTCTCGCCAAAGACCTGAGCGTTCTGGC 
CCTGAGGGAAGGCCTTGGCCTCCCGCCCGGCGACGACCCCATGCTCCTCGCCTAC 
CTCCTGGACCCTTCCAACACCACCCCCGAGGGGGTGGCCCGGCGCTACGGCGGG 
25 GAGTGGACGGAGGAGGCGGGGGAGCGGGCCGCCCTTTCCGAGAGGCTCTTCGCC 
AACCTGTGGGGGAGGCTTGAGGGGGAGGAGAGGCTCCTTTGGCTTTACCGGGAG 
GTGGAGAGGCCCCTTTCCGCTGTCCTGGCCCACATGGAGGCCACGGGGGTGCGC 
CTGGACGTGGCCTATCTCAGGGCCTTGTCCCTGGAGGTGGCCGAGGAGATCGCCC 
GCCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCACCCCTTCAACCTCAACTCCCG 
30 GGACCAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTCCCGCCATCGGCAA 
GACGGAGAAGACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAGGCCCTCCG 
CGAGGCCCACCCCATCGTGGAGAAGATCCTGCAGTACCGGGAGCTCACCAAGCT 
GAAGAGCACCTACATTGACCCCTTGCCGGACCTCATCCACCCCAGGACGGGCCG 
CCTCCACACCCGeXTCAACCAGACGGCCACGGCCACGGGCAGGCTAAGTAGCTC 
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CGATCCCAACCTCCAGAACATCCCCGTCCGCACCCCGCTTGGGCAGAGGATCCGC 
CGGGCCTTCATCGCCGAGGAGGGGTGGCTATTGGTGGCCCTGGACTATAGCCAG 
ATAGAGCTCAGGGTGCTGGCCCACCTCTCCGGCGACGAGAACCTGATCCGGGTCT 
TCCAGGAGGGGCGGGACATCCACACGGAGACCGCCAGCTGGATGTTCGGCGTCC 
5 CCCGGGAGGCCGTGGACCCCCTGATGCGCCGGGCGGCCAAGACCATCAACTTCG 
GGGTCCTCTACGGCATGTCGGCCCACCGCCTCTCCCAGGAGCTAGCCATCCCTTA 
CGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAGCTTCCCCAAGGTGCGG 
GCCTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGGGGTACGTGGAGAC 
CCTCTTCGGCCGCCGCCGCTACGTGCCAGACCTAGAGGCCCGGGTGAAGAGCGT 
1 0 GCGGG AGGCGGCCG AGCGC ATGGCCTTC AAC ATGCCCGTCC AGGGC ACCGCCGC 
CGACCTCATGAAGCTGGCTATGGTGAAGCTCTTCCCCAGGCTGGAGGAAATGGG 
GGCCAGGATGCTCCTTCAGGTCCACGACGAGCTGGTCCTCGAGGCCCCAAAAGA 
5 GAGGGCGGAGGCCGTGGCCCGGCTGGCCAAGGAGGTCATGGAGGGGGTGTATCC 
m CCTGGCCGTGCCCCTGGAGGTGGAGGTGGGGATAGGGGAGGACTGGCTCTCCGC 
M 15 CAAGGAGGGCATTGATGGCCGCGGCGGAGGCGGGCATCATCATCATCATCATTA 
W A 

13 SEQ ID NO:4 The amino acid sequence of Sso7d-ATaq fusion protein 

^ MITNSSATVKFKYKGEEKEVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDA 
Q 20 PKELLQMLEKQKKGGGVTSPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLALAA 
y ARGGRVHRAPEPYKALRDLKE ARGLLAKDLS VLALREGLGLPPGDDFMLLA\T:.LDP 

SNTTPEGVARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERPLS 
AVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERVLF 
DELGLPAIGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIH 
25 PRTGRLHTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYS 
QIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFGV 
LYGMSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGRR 
RYVPDLEARVKSWEAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLL 
QVHDELVLEAPKERAEAVARLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKEGIDGR 
30 GGGGHHHHHH 

SEQ ID NO:5 The DNA sequence encoding the Sso7d-Taq fusion protein 

ATGATTACGAATTCGAGCGCAACCGTAAAGTTCAAGTACAAAGGCGAAGAAAAA 
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GAGGTAGACATCTCCAAGATCAAGAAAGTATGGCGTGTGGGCAAGATGATCTCC 
TTCACCTACGACGAGGGCGGTGGCAAGACCGGCCGTGGTGCGGTAAGCGAAAAG 
GACGCGCCGAAGGAGCTGCTGCAGATGCTGGAGAAGCAGAAAAAGGGCGGCGG 
TGTCACTAGTGGGATGCTGCCCCTCTTTGAGCCCAAGGGCCGGGTCCTCCTGGTG 
5 GACGGCCACCACCTGGCCTACCGCACCTTCCACGCCCTGAAGGGCCTCACCACCA 
GCCGGGGGGAGCCGGTGCAGGCGGTCTACGGCTTCGCCAAGAGCCTCCTCAAGG 
CCCTCAAGGAGGACGGGGACGCGGTGATCGTGGTCTTTGACGCCAAGGCCCCCT 
CCTTCCGCCACGAGGCCTACGGGGGGTACAAGGCGGGCCGGGCCCCCACGCCAG 
AGGACTTTCCCCGGCAACTCGCCCTCATCAAGGAGCTGGTGGACCTCCTGGGGCT 
1 0 GGCGCGCCTCGAGGTCCCGGGCTACGAGGCGG ACG ACGTCCTGGCC AGCCTGGC 
CAAGAAGGCGGAAAAGGAGGGCTACGAGGTCCGCATCCTCACCGCCGACAAAG 
ACCTTTACCAGCTCCTTTCCGACCGCATCCACGTCCTCCACCCCGAGGGGTACCT 
CATCACCCCGGCCTGGCTTTGGGAAAAGTACGGCCTGAGGCCCGACCAGTGGGC 
CGACTACCGGGCCCTGACCGGGGACGAGTCCGACAACCTTCCCGGGGTCAAGGG 
! 15 CATCGGGGAGAAGACGGCGAGGAAGCTTCTGGAGGAGTGGGGGAGCCTGGAAG 
CCCTCCTCAAGAACCTGGACCGGCTGAAGCCCGCCATCCGGGAGAAGATCCTGG 
CCCACATGGACGATCTGAAGCTCTCCTGGGACCTGGCCAAGGTGCGCACCGACCT 
GCCCCTGGAGGTGGACTTCGCCAAAAGGCGGGAGCCCGACCGGGAGAGGCTTAG 
GGCCTTTCTGGAGAGGCTTGAGTTTGGCAGCCTCCTCCACGAGTTCGGCCTTCTG 
20 GAAAGCCCCAAGGCcCTGGAGGAGGCCCCCTGGCCCCCGCCGGAAGGGGCCTTC 
GTGGGCTTTGTGCTTTCCCGCAAGGAGCCCATGTGGGCCGATCTTCTGGCCCTGG 
CCGCCGCCAGGGGGGGCCGGGTCCACCGGGCCCCCGAGCCTTATAAAGCCCTCA 
GGGACCTGAAGGAGGCGCGGGGGCTTCTCGCCAAAGACCTGAGCGTTCTGGCCC 
TGAGGGAAGGCCTTGGCCTCCGGCCCGGCGACGACCCCATGCTCCTCGCCTACCT 
25 CCTGGACCCTTCCAACACCACCCCCGAGGGGGTGGCCCGGCGCTACGGCGGGGA 
GTGGACGGAGGAGGCGGGGGAGCGGGCCGCCCTTTCCGAGAGGCTCTTCGCCAA 
CCTGTGGGGGAGGCTTGAGGGGGAGGAGAGGCTCCTTTGGCTTTACCGGGAGGT 
GGAGAGGCCCCTTTCCGCTGTCCTGGCCCACATGGAGGCCACGGGGGTGCGCCT 
GGACGTGGCCTATCTCAGGGCCTTGTCCCTGGAGGTGGCCGAGGAGATCGCCCG 
30 CCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCACCCCTTCAACCTCAACTCCCGG 
GACCAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTCCCGCCATCGGCAAG 
ACGGAGAAGACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAGGCCCTCCGC 
GAGGCCCACCCCATCGTGGAGAAGATCCTGCAGTACCGGGAGCTCACCAAGCTG 
AAGAGCACCTACATTGACCCCTTGCCGGACCTCATCCACCCCAGGACGGGCCGCC 
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TCCACACCCGCTTCAACCAGACGGCCACGGCCACGGGCAGGCTAAGTAGCTCCG 
ATCCCAACCTCCAGAACATCCCCGTCCGCACCCCGCTTGGGCAGAGGATCCGCCG 
GGCCTTCATCGCCGAGGAGGGGTGGCTATTGGTGGCCCTGGACTATAGCCAGAT 
AGAGCTCAGGGTGCTGGCCCACCTCTCCGGCGACGAGAACCTGATCCGGGTCTTC 
5 CAGGAGGGGCGGGACATCCACACGGAGACCGCCAGCTGGATGTTCGGGGTCCCC 
CGGGAGGCCGTGGACCCCCTGATGCGCCGGGCGGCCAAGACCATCAACTTCGGG 
GTCCTCTACGGCATGTCGGCCCACCGCCTCTCCCAGGAGCTAGCCATCCCTTACG 
AGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAGCTTCCCCAAGGTGCGGGC 
CTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGGGGTACGTGGAGACCC 
1 0 TCTTCGGCCGCCGCCGCTACGTGCCAGACCT AGAGGCCCGGGTG AAGAGCGTGC 
GGGAGGCGGCCGAGCGCATGGCCTTCAACATGCCCGTCCAGGGCACCGCCGCCG 
ACCTCATGAAGCTGGCTATGGTGAAGCTCTTCCCCAGGCTGGAGGAAATGGGGG 
CCAGGATGCTCCTTCAGGTCCACGACGAGCTGGTCCTCGAGGCCCCAAAAGAGA 
GGGCGGAGGCCGTGGCCCGGCTGGCCAAGGAGGTCATGGAGGGGGTGTATCCCC 



Q 15 TGGCCGTGCCCCTGGAGGTGGAGGTGGGGATAGGGGAGGACTGGCTCTCCGCCA 



AGGAGGGC ATTGATGGCCGCGGCGGAGGCGGGC ATCATCATCATCATCATTAA 

Q SEQ ID NO:6 The amino acid sequence of Sso7d-Taq fusion protein. 

g MITNSSATVKFKYKGEEKEVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDA 
2 20 PKELLQMLEKQKKGGGVTSGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTSRGE 
P PVQAVYGFAKSLLKALKEDGDAVIVVFDAKAPSFRHEAYGGYKAGRAPTPEDFPRQ 
LALIKELVDLLGLARLEVPGYEADDVLASLAKKAEKEGYEVRILTADKDLYQLLSDR 
IHVLHPEGYLITPAWLWEKYGLRPDQWADYRALTGDESDNLPGVKGIGEKTARKLL 
EEWGSLEALLKNLDRLKPAIREKILAHMDDLKLSWDLAKVRTDLPLEVDFAKRREP 
25 DRERLRAFLERLEFGSLLHEFGLLESPKALEEAPWPPPEGAFVGFVLSRKEPMWADL 
LALAAARGGRVHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLA 
YLLDPSNTTPEGVARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREV 
ERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQL 
ERVLFDELGLPAIGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPL 
30 PDLIHPRTGRLHTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVA 
LDYSQIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTI 
NFGVLYGMSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETL 
FGRRRYVPDLEARVKSVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGA 
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# 



RMLLQVHDELVLEAPKERAEAVARLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKE 
GIDGRGGGGHHHHHH 

SEQ ID NO:7 The DNA sequence encoding the Pfu-Sso7d fusion protein 

5 ATGATTTTAGATGTGGATTACATAACTGAAGAAGGAAAACCTGTTATTAGGCTAT 
TCAAAAAAGAGAACGGAAAATTTAAGATAGAGCATGATAGAACTTTTAGACCAT 
ACATTTACGCTCTTCTCAGGGATGATTCAAAGATTGAAGAAGTTAAGAAAATAAC 
GGGGGAAAGGCATGGAAAGATTGTGAGAATTGTTGATGTAGAGAAGGTTGAGAA 
AAAGTTTCTCGGCAAGCCTATTACCGTGTGGAAACTTTATTTGGAACATCCCCAA 

1 0 GATGTTCCCACTATT AGAGAAAAAGTTAGAGAAC ATCC AGCAGTTGTGGAC ATCT 
TCGAATACGATATTCCATTTGCAAAGAGATACCTCATCGACAAAGGCCTAATACC 
AATGGAGGGGGAAGAAGAGCTAAAGATTCTTGCCTTCGATATAGAAACCCTCTA 
TCACGAAGGAGAAGAGTTTGGAAAAGGCCCAATTATAATGATTAGTTATGCAGA 
TGAAAATGAAGCAAAGGTGATTACTTGGAAAAACATAGATCTTCCATACGTTGA 

1 5 GGTTGTATCAAGCGAGAGAGAGATGATAAAGAGATTTCTC AGGATTATC AGGGA 
GAAGGATCCTGACATTATAGTTACTTATAATGGAGACTCATTCGACTTCCCATAT 
TTAGCGAAAAGGGCAGAAAAACTTGGGATTAAATTAACCATTGGAAGAGATGGA 
AGCGAGCCCAAGATGCAGAGAATAGGCGATATGACGGCTGTAGAAGTCAAGGG 
AAGAATACATTTCGACTTGTATCATGTAATAACAAGGACAATAAATCTCCCAACA 

20 TACACACTAGAGGCTGTATATGAAGCAATTTTTGGAAAGCCAAAGGAGAAGGTA 
TACGCCGACGAGATAGCAAAAGCCTGGGAAAGTGGAGAGAACCTTGAGAGAGTT 
GCCAAATACTCGATGGAAGATGCAAAGGCAACTTATGAACTCGGGAAAGAATTC 
CTTCCAATGGAAATTCAGCTTTCAAGATTAGTTGGACAACCTTTATGGGATGTTT 
CAAGGTCAAGCACAGGGAACCTTGTAGAGTGGTTCTTACTTAGGAAAGCCTACG 

25 AAAGAAACGAAGTAGCTCCAAACAAGCCAAGTGAAGAGGAGTATCAAAGAAGG 
CTCAGGGAGAGCTACACAGGTGGATTCGTTAAAGAGCCAGAAAAGGGGTTGTGG 
GAAAACATAGTATACCTAGATTTTAGAGCCCTATATCCCTCGATTATAATTACCC 
ACAATGTTTCTCCCGATACTCTAAATCTTGAGGGATGCAAGAACTATGATATCGC 
TCCTCAAGTAGGCCACAAGTTCTGCAAGGACATCeCTGGTTTTATACCAAGTCTC 

30 TTGGGACATTTGTTAGAGGAAAGACAAAAGATTAAGACAAAAATGAAGGAAACT 
CAAGATCCTATAGAAAAAATACTCCTTGACTATAGACAAAAAGCGATAAAACTC 
TTAGCAAATTCTTTCTACGGATATTATGGCTATGCAAAAGCAAGATGGTACTGTA 
AGGAGTGTGCTGAGAGCGTTACTGCCTGGGGAAGAAAGTACATCGAGTTAGTAT 
GGAAGGAGCTCGAAGAAAAGTTTGGATTTAAAGTCCTCTACATTGACACTGATG 
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GTCTCTATGCAACTATCCCAGGAGGAGAAAGTGAGGAAATAAAGAAAAAGGCTC 
TAGAATTTGTAAAATACATAAATTCAAAGCTCCCTGGACTGCTAGAGCTTGAATA 
TGAAGGGTTTTATAAGAGGGGATTCTTCGTTACGAAGAAGAGGTATGCAGTAAT 
AGATGAAGAAGGAAAAGTCATTACTCGTGGTTTAGAGATAGTTAGGAGAGATTG 
5 GAGTGAAATTGCAAAAGAAACTCAAGCTAGAGTTTTGGAGACAATACTAAAACA 
CGGAGATGTTGAAGAAGCTGTGAGAATAGTAAAAGAAGTAATACAAAAGCTTGC 
CAATTATGAAATTCCACCAGAGAAGCTCGCAATATATGAGCAGATAACAAGACC 
ATTACATGAGTATAAGGCGATAGGTCCTCACGTAGCTGTTGCAAAGAAACTAGCT 
GCTAAAGGAGTTAAAATAAAGCCAGGAATGGTAATTGGATACATAGTACTTAGA 
1 0 GGCGATGGTCCAATTAGCAATAGGGCAATTCTAGCTGAGGAATACGATCCCAAA 
AAGCACAAGTATGACGCAGAATATTACATTGAGAACCAGGTTCTTCCAGCGGTA 
CTTAGGATATTGGAGGGATTTGGATACAGAAAGGAAGACCTCAGATACCAAAAG 
y ACAAGACAAGTCGGCCTAACTTCCTGGCTTAACATTAAAAAATCCGGTACCGGC 
^ GGTGGCGGTGCAACCGTAAAGTTCAAGTACAAAGGCGAAGAAAAAGAGGTAGA 
d 15 CATCTCCAAGATCAAGAAAGTATGGCGTGTGGGCAAGATGATCTCCTTCACCTAC 
^ GACGAGGGCGGTGGCAAGACCGGCCGTGGTGCGGTAAGCGAAAAGGACGCGCC 
W GAAGGAGCTGCTGCAGATGCTGGAGAAGCAGAAAAAGTGA 

Ln 

r ; SEQ ID NO:8 The amino acid sequence of the Pfu-Sso7d fusion protein 

O 20 MILDVDYITEEGKPVIRLFKKENGKFKIEHDRTFRPYIYALLRDDSKIEEVKKITGERH 
2 GKIVRIVDVEKVEKXFLGKPITVIVKLYLEHPQDVPTIREKVREKPAWDIFEYDIPFA 
KRYLIDKGLIPMEGEEELKILAFDIETLYHEGEEFGKGPIIMISYADENEAKVITWKNID 
LPYVEWSSEREMIKRFLRIIREKDPDIIVTYNGDSFDFPYLAKRAEKLGIKLTIGRDGS 
EPKMQRIGDMTAVEVKGRJHFDLYHVITRTINLPTYTLEAVYEAIFGKPKEKVYADEI 
25 AKAWESGENLERVAKYSMEDAKATYELGKEFLPMEIQLSRLVGQPLWDVSRSSTGN 
LVEWFLLRKAYERNEVAPNKPSEEEYQRRLRESYTGGFVKEPEKGLWENIVYLDFR 
ALYPSIIITHNVSPDTLNLEGCKNYDIAPQVGHKFCKDIPGFIPSLLGHLLEERQKIKTK 
MKETQDPIEKILLDYRQKAIKiLAJNlSFYGYYGYAKARWYCKECAESVTAWGRKYIE 
LVWKELEEKFGFKVLYIDTDGLYATIPGGESEEIKKKALEFVKYINSKLPGLLELEYE 
30 GFYKRGFFVTKKRYAVIDEEGKVITRGLEIVRRDWSEIAKETQARVLETILKHGDVEE 
AVWVKEVIQKLANYEIPPEKLAIYEQITRPLHEYKAIGPHVAVAKKLAAKGVKIKPG 
MVIGYIVLRGDGPISNRAILAEEYDPKKHKYDAEYYIENQVLPAVLRILEGFGYRKED 
LRYQKTRQVGLTSWLNIKKSGTGGGGATVKFKYKGEEKEVDISKIKKVWRVGKMIS 
FTYDEGGGKTGRGAVSEKDAPKELLQMLEKQKK 
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SEQ ID NO:9 The DNA sequence encoding the Sac7d-ATaq fusion protein 

ATGATTACGAATTCGACGGTGAAGGTAAAGTTCAAGTATAAGGGTGAAGAGAAA 
GAAGTAGACACTTCAAAGATAAAGAAGGTTTGGAGAGTAGGCAAAATGGTGTCC 
5 TTTACCTATGACGACAATGGTAAGACAGGTAGAGGAGCTGTAAGCGAGAAAGAT 
GCTCCAAAAGAATTATTAGACATGTTAGCAAGAGCAGAAAGAGAGAAGAAAGG 
CGGCGGTGTCACTAGTCCCAAGGCCCTGGAGGAGGCCCCCTGGCCCCCGCCGGA 
AGGGGCCTTCGTGGGCTTTGTGCTTTCCCGCAAGGAGCCCATGTGGGCCGATCTT 
CTGGCCCTGGCCGCCGCCAGGGGGGGCCGGGTCCACCGGGCCCCCGAGCCTTAT 
1 0 AAAGCCCTC AGGGACCTGAAGGAGGCGCGGGGGCTTCTCGCCAAAGACCTGAGC 
GTTCTGGCCCTGAGGGAAGGCCTTGGCCTCCCGCCCGGCGACGACCCCATGCTCC 
TCGCCTACCTCCTGGACCCTTCCAACACCACCCCCGAGGGGGTGGCCCGGCGCTA 
Q CGGCGGGGAGTGGACGGAGGAGGCGGGGGAGCGGGCCGCCCTTTCCGAGAGGC 
m TCTTCGCCAACCTGTGGGGGAGGCTTGAGGGGGAGGAGAGGCTCCTTTGGCTTTA 
15 CCGGGAGGTGGAGAGGCCCCTTTCCGCTGTCCTGGCCCACATGGAGGCCACGGG 

LJ 

W GGTGCGCCTGGACGTGGCCTATCTCAGGGCCTTGTCCCTGGAGGTGGCCGAGGA 
y GATCGCCCGCCTCGAGGCCGGGTCTTCCGCCTGGCCGGCCACCCCTTCAACCTCA 
L ACTCCCGGGACCAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTCCCGCCAT 
m CGGCAAGACGGAGAAGACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAGGC 
20 CCTCCGCGAGGCCCACCCCATCGTGGAGAAGATCCTGCAGTACCGGGAGCTCAC 



CAAGCTGAAGAGCACCTACATTGACCCCTTGCCGGACCTCATCCACCCCAGGACG 
GGCCGCCTCCACACCCGCTTCAACCAGACGGCCACGGCCACGGGCAGGCTAAGT 
AGCTCCGATCCCAACCTCCAGAACATCCCCGTCCGCACCCCGCTTGGGCAGAGGA 
TCCGCCGGGCCTTCATCGCCGAGGAGGGGTGGCTATTGGTGGCCCTGGACTATAG 

25 CCAGATAGAGCTCAGGGTGCTGGCCCACCTCTCCGGCGACGAGAACCTGATCCG 
GGTCTTCCAGGAGGGGCGGGACATCCACACGGAGACCGCCAGCTGGATGTTCGG 
CGTCCCCCGGGAGGCCGTGGACCCCCTGATGCGCCGGGCGGCCAAGACCATCAA 
CTTCGGGGTCCTCTACGGCATGTCGGCCCACCGCCTCTCCCAGGAGCTAGCCATC 
CCTTACGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAGCTTCCCCAAGG 

30 TGCGGGCCTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGGGGTACGTG 
GAGACCCTCTTCGGCCGCCGCCGCTACGTGCCAGACCTAGAGGCCCGGGTGAAG 
AGCGTGCGGGAGGCGGCCGAGCGCATGGCCTTCAACATGCCCGTCCAGGGCACC 
GCCGCCGACCTCATGAAGCTGGCTATGGTGAAGCTCTTCCCCAGGCTGGAGGAA 
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ATGGGGGCCAGGATGCTCCTTCAGGTCCACGACGAGCTGGTCCTCGAGGCCCCA 
AAAGAGAGGGCGGAGGCCGTGGCCCGGCTGGCCAAGGAGGTCATGGAGGGGGT 
GTATCCCCTGGCCGTGCCCCTGGAGGTGGAGGTGGGGATAGGGGAGGACTGGCT 
CTCCGCCAAGGAGGGCATTGATGGCCGCGGCGGAGGCGGGCATCATCATCATCA 
5 TCATTAA 

SEQ ID NO: 10 The amino acid sequence of the Sac7d-ATaq fusion protein 

MITNSTVKVKFKYKGEEKEVDTSKIKKVWRVGKMVSFTYDDNGKTGRGAVSEKDA 
PKELLDMLARAEREKKGGGVTSPKALEEAPWPPPEGAFVGFVLSRKEPMWADLLAL 
1 0 AAARGGRVHRAPEPYKALRDLKEARGLLAKDLS VLALREGLGLPPGDDPMLLAYLL 
DPSNTTPEGVARRYGGEWTEEAGERAALSERLFANLWGRLEGEERLLWLYREVERP 
LSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGHPFNLNSRDQLERV 
Q LFDELGLPAIGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLI 
5 HPRTGRLHTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDY 
N 15 SQIELRVLAHLSGDENLIRVFQEGRDIHTETASWMFGVPREAVDPLMRRAAKTINFG 
y VLYGMSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYVETLFGR 
RRYWDLEARWSX^AAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARML 
LQVHDELVLEAPKERAEAVARLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKEGIDG 
S RGGGGHHHHHH 

y 20 

Q 

O SEQ ID NO: 11 The DNA sequence encoding the PL-ATaq fusion protein 

ATGATTACGAATTCGAAGAAAAAGAAAAAGAAAAAGCGTAAGAAACGCAAAAA 
GAAAAAGAAAGGCGGCGGTGTCACTAGTGGCGCAACCGTAAAGTTCAAGTACAA 
AGGCGAAGAAAAAGAGGTAGACATCTCCAAGATCAAGAAAGTATGGCGTGTGG 

25 GCAAGATGATCTCCTTCACCTACGACGAGGGCGGTGGCAAGACCGGCCGTGGTG 
CGGTAAGCGAAAAGGACGCGCCGAAGGAGCTGCTGCAGATGCTGGAGAAGCAG 
AAAAAGGGCGGCGGTGTCACCAGTCCCAAGGCCCTGGAGGAGGCCCCCTGGCCC 
CCGCCGGAAGGGGCCTTCGTGGGCTTTGTGCTTTCCCGCAAGGAGCCCATGTGGG 
CCGATCTTCTGGCCCTGGCCGCCGCCAGGGGGGGCCGGGTCCACCGGGCCCCCG 

30 AGCCTTATAAAGCCCTCAGGGACCTGAAGGAGGCGCGGGGGCTTCTCGCCAAAG 
ACCTGAGCGTTCTGGCCCTGAGGGAAGGCCTTGGCCtCCCGCCCGGCGACGACCC 
CATGCTCCTCGCCTACCTCCTGGACCCTTCCAACACCACCCCCGAGGGGGTGGCC 
CGGCGCTACGGCGGGGAGTGGACGGAGGAGGCGGGGGAGCGGGCCGCCCTTTCC 
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GAGAGGCTCTTCGCCAACCTGTGGGGGAGGCTTGAGGGGGAGGAGAGGCTCCTT 
TGGCTTTACCGGGAGGTGGAGAGGCCCCTTTCCGCTGTCCTGGCCCACATGGAGG 
CCACGGGGGTGCGCCTGGACGTGGCCTATCTCAGGGCCTTGTCCCTGGAGGTGGC 
CGAGGAGATCGCCCGCCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCACCCCTTC 
5 AACCTCAACTCCCGGGACCAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTTC 
CCGCCATCGGCAAGACGGAGAAGACCGGCAAGCGCTCCACCAGCGCCGCCGTCC 
TGGAGGCCCTCCGCGAGGCCCACCCCATCGTGGAGAAGATCCTGCAGTACCGGG 
AGCTCACCAAGCTGAAGAGCACCTACATTGACCCCTTGCCGGACCTCATCCACCC 
CAGGACGGGCCGCCTCCACACCCGCTTCAACCAGACGGCCACGGCCACGGGCAG 
1 0 GCTAAGTAGCTCCGATCCCAACCTCCAGAACATCCCCGTCCGC ACCCCGCTTGGG 
CAGAGGATCCGCCGGGCCTTCATCGCCGAGGAGGGGTGGCTATTGGTGGCCCTG 
GACTATAGCCAGATAGAGCTCAGGGTGCTGGCCCACCTCTCCGGCGACGAGAAC 
P CTGATCCGGGTCTTCCAGGAGGGGCGGGACATCCACACGGAGACCGCCAGCTGG 
m ATGTTCGGCGTCCCCCGGGAGGCCGTGGACCCCCTGATGCGCCGGGCGGCCAAG 
Q 15 ACCATCAACTTCGGGGTCCTCTACGGCATGTCGGCCCACCGCCTCTCCCAGGAGC 
W TAGCCATCCCTTACGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAGCTT 
y CCCCAAGGTGCGGGCCTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGG 
t. GGTACGTGGAGACCCTCTTCGGCCGCCGCCGCTACGTGCCAGACCTAGAGGCCC 
m GGGTGAAGAGCGTGCGGGAGGCGGCCGAGCGCATGGCCTTCAACATGCCCGTCC 
Q 20 AGGGCACCGCCGCCGACCTCATGAAGCTGGCTATGGTGAAGCTCTTCCCCAGGCT 
O GGAGGAAATGGGGGCCAGGATGCTCCTTCAGGTCCACGACGAGCTGGTCCTCGA 
GGCCCCAAAAGAGAGGGCGGAGGCCGTGGCCCGGCTGGCCAAGGAGGTCATGG 
AGGGGGTGTATCCCCTGGCCGTGCCCCTGGAGGTGGAGGTGGGGATAGGGGAGG 
ACTGGCTCTCCGCCAAGGAGGGCATTGATGGCCGCGGCGGAGGCGGGCATCATC 
25 ATCATCATCATTAA 

SEQ ID NO: 12 The amino acid sequence of PL-ATaq fusion protein 

MITNSKKKKKKXRKKRKKKKKGGGVTSGATVKJ^KYKGEEK^VDISKIKKVWRVGK 
MISFTYDEGGGKTGRGAVSEKDAPKELLQMLEKQKKGGGVTSPKALEEAPWPPPEG 
30 AFVGFVLSRKEPMWADLLALAAARGGRVHRAPEPYKALRDLKEARGLLAKDLSVL 
ALREGLGLPPGDDPMLLAYLLDPSNTTPEGVARRYGGEWTEEAGERAALSERLFAN 
LWGRLEGEERLLWLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLE 
AEVFRLAGHPFNLNSRDQLERVLFDELGLPAIGKTEKTGKRSTSAAVLEALREAHPIV 
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EKILQYRELTKLKSTYIDPLPDLIHPRTGRLHTRFNQTATATGRLSSSDPNLQNIPVRTP 
LGQRIRRAFIAEEGWLLVALDYSQIELRVLAHLSGDENLIRVFQEGRDIHTETASWMF 
GVPREAVDPLMRRAAKTINFGVLYGMSAHRLSQELAIPYEEAQAFIERYFQSFPKVR 
AWIEKTLEEGRRRGYVETLFGRRRYVPDLEARVKSVREAAERMAFNMPVQGTAAD 
5 LMKIAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAEAVARLAKEVMEGVYPL 
AVPLEVEVGIGEDWLSAKEGIDGRGGGGHHHHHH 

SEQIDNO:13 PRIMER L7 IF 

5 '-CCTGCTCTGCCGCTTC ACGC-3 ' 

10 

SEQIDNO:14 PRIMER L71R 

5'-GCACAGCGGCTGGCTGAGGA-3' 

g SEQIDNO:15 PRIMER L18015F 

9 15 5'-TGACGGAGGATAACGCCAGCAG-3' 

% SEQIDNO:16 PRIMER L23474R 

r 5 '-GAAAGACGA TGGGTCGCTAATACGC-3 ' 

W 20 SEQIDNO:!? PRIMER L18015F 

5 5'-TGACGGAGGATAACGCCAGCAG-3' 

SEQ ID NO:18 PRIMER L29930R 

5 ' -GGGGTTGG AGGTC AATGGGTTC-3 ' 

25 

SEQIDNO:19 PRIMER L30350F 

5 ' -CCTGCTCTGCCGCTTC ACGC-3 ' 

SEQ ID NO:20 PRIMER L35121R 

30 5'-CACATGGTACAGCAAGCCTGGC-3' 

SEQ ID NO:21 PRIMER L2089F 

5'-CCCGTATCTGCTGGGA TACTGGC-3 
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f 

SEQ ID NO: 22 PRIMER L7112R 

5 '-CAGCGGTGCTGACTGAATCATGG-3 

SEQ ID NO:23 PRIMER L30350F 

5 5'-CCTGCCTGCCGCTTCACGC-3' 

SEQ ID NO:24 PRIMER L40547R 

5'-CCAATACCCGTTTCA TCGCGGC-3' 

10 SEQIDNO:25 PRIMER H-Amelo-Y 

5'-CCACCTCATCCTGG GCACC-3' 

Q SEQIDNO:26 PRIMER H-Amelo-YR 

g 5'-GCTTGAGGCCAACCATCAGAGC-3' 
H 15 

r; SEQ ID NO:27 Human beta-globin primer 536F 

W 5'-GGTTGGCCAATCTACTCCCAGG-3' 

i , I 
%^ 

7^ SEQ ID NO:28 Human beta-globin primer 536R 

W 20 5'-GCTCACTCAGTGTGGCAAAG-3' 

SEQ ID NO:29 Humaii beta-globin primer 1408R 

5'-GATTAGCAAAAGGGCCTAGCTTGG-3' 



53 



